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IDENTIPIEES 
ABSTRACT 

Teacher dissatisfaction with the existing math 
testing program in the Hinneapolis Public Schools "(HPS) led to the 
estkblishpent of the Committee on Citywide Hathematics Testing* The 
ct>m littee was formed to identify tests to be used, grade levels to be 
tesied, and testing schedules. The comnyittee recommended that tests 
be given in the spring inste§^ of the fallr as before, that tests be 
givien in grades^ 3, 6,* and 8, and that a pilot study be conducted by 
the* Research and Evaluation Department to select one test for each 
grade from among those tests which had passed the initial screening • 
Five tests were u^ider consideratiq;ii at the third grade level (the 
SBA, the CAT, the Stanford, and the Metropolitan Form F and Primary 
JI), four at the sixth grade level (the SEA, the CAT, the Stanford, 
and the Hetropolitan Form F),.and three tests at the eighth grade 
level (the SEA^ the CAT, and the Stanford) . The Stanford series was 
eliminated because testing materials could not be obtained in time 
for the pilot test. Each of the tests was given in three classrooms. 
Each teacher administering one of the tests responded to a 
questionnaire on the test used. Similar ratings were obtained on all 
the tests for: ' face validity, reading difficulty, cultural bias, 
pupil motivation, administration instructions, and interpretation 
instructions. Based on the limited evidence from this pilot tests, it 
was recommended that the SEA be selected. (Author/EC) 
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Selection of a Mathematics Test 
for a Citywide Testing Program 

Summary 

See 

Teacher dissatisfaction with the existing math testing program 
in the Minneapolis Public Schools (MPS)' led to the estabUshment 
of the Comittee on Citywide Mathematics Testing. The Copimittee ^ 
was formed in December 1972 to identify tests to be used, .grade 
levels to be tested^ and testing schedules. . ' 

After examining a nuatoer of tests, the committee recooa^nded 
that tests be given in the spri^ instead of the fall, as before, 
that tests be given in grades 3, 6, and 8, and that a pilot study 
be. conducted by the Research and Evaluation Department in June 1973 1 
to select one test for each grade from among those tests which had 
passed the .iniljial screening. Five tests were under consideration at 
the third grade level (the SRA, the CAT, the Stanford, and the Metro- 
politan Form- F and Primary II) four at 'the sixth grade (the SRA, the 
CAT, the Stanford, and the Metropolitan Form F), and three tests at 
the eighth, grade level (the SRAj the CAT, and the Stanford). The 
Stanford series was eliminated because testing materials could not 
be obtained in time for the pilot test. 

Each of the tests was given in- three classrooms which were- 
randomly selected within schools ^ich were themselves randomly 
selected from high, middle, and low income schools ."^ The pilot * h 
study involved 590 students and 25 teachers (two of the 27 teachers 
scheduled to participate did not take paart). Each teacher administering 
one of the tests responded to a qxiestionnaire on the test txsed. 
Sampling limitations and the small nuniber of teachers involved dictated 
that the results of this stufjy be considered suggestive rather than 
conclusive. 

With one minor exception*, pupils from high income schools 
•scored highest and pupils from low income schools scored lowest 
regardless of thfe .test used. Errors in test scoring were made 6 
by all tsachers involved in the pilot t^st. ih 

Teachers' ratings did not vary substantially for most tests ^ 
characteristics. Fx>t practical purposes, similar ratings were obtained v, 
on all the tests for: face validity, reading dlffictaty, cultural 
bias, pupil motivation, adndLnistration instructions, and interpre- 
tation instructions. The CAT and the SRA were, however,, the most 
favorably received, overall. The SRA yielded median percentile 6 
ranks which most 'consistently paralleled publishers' norm medians. 

Noting the^ desirability of tising the same test^ (different A 
forms) at all three grade levels, it wks recommended that if a ^3 
decision* can be delayed fdr another year, further study of the 
SRA, the CAT, and the Stanford shojald be mde. If a decision must 
be made now, based on the limited evidence from this pilot test, 
it was recommended that the SRA be selecfed. 
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Minneapolis Public Schools 

^ Selection of a Mathematics Test 
for a Citywide Testing Progrcto* 

In Decenier 1972 the Committee on Citywide Mathenetics Testing^ 
. was established by th^^nneapolis Public Schools^ to review the mathematics 
component of the citywide testing .program* Teacher dissatisfaction with 
the existing program> particularly disQatisfactic«i with the use of *tlie 
Iowa Test of Basic Skills Modem Mathematics Supplement in inner-city 
schools, was the impetus f or ^he establishment of this committee. 
Identifying tests to be used, grade levels at which tests were to be 
given, and testing times were the specific charges to the committee • 

After examining a number of tests, the fourteen member committee 
recommended that the new tests be given in this spring-- rather than in 
the fall which had been the custom— and that a pilot study be conducted 
in June 1973 to select one test from the five which had passed the 
initial screening. Grades 3> 6 and 8 were, to be tested* 

In March 1973 > the Research and Evaluation Department (R & e) was 
asked to conduct a pilot study of the following tests: 

Grade 3 

SRA Achievement Series, 1972, Primary Level II 
^California Achievement Test, 1970, Level 2 ' 
Metropolitan Achievement, 1970, Priiaaiy II 
Metropolitan Achievement, 1970, Elementary Battery, Form F 
Stanford Achievement 'Series, 1972, Primary Level II 



'Citywide Mathematics Testing Commi'ttfee members: Ed^Anderjs^n, Don Borgeson, 
Charles Dorniden, Kathy Feller, Ruth HarW Art Indelicate, "Randy Johjiison, 
George Keprios, Mary Lou Knipe, Elmer K^, Dennis Lander, Sally Sl($8^n, 
Mike SUndberg, Ross Taylor • ' 
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grade 6 

SBA Achievenent Series, 1972, Oreen Level,*" Form E 
California* Achievement Test, 1970, Level 3 
Metropolitan Achieveaent, 1970, Intermediate, Form F 
Stanford Achievement Series, 1972, Intermediate Level II 

? grade 8 

SBA At^hievement Series, Red Level, Form E 
California Achievement Test, 1970, Level k 
St€uiford Achievement Series, 1972, Advanced Battery 

Initial plans were to cont3?act an Independent reseeirch organization 
to conduct a pilot test focussing cm teacher reactions, opinions of 
mathanatics department chairmen, a review of technical literature on 
the test, and test administration to a sample of students • These plans 
were discarded bqcaxise of a lack of fxmds. 

The actual study, conducted by E Department personnel, provided 
only 'a partial answer to the questions asked. Tests were administel'ed 
to a sample of students, and teachers administering these tests responded 
to a questionnaire on the tests they had used. Sampling limitations 
and the small number of teachers involved dictate that the results of. 
this study be considered suggestiire rather than conclusive. 

The Pilot Test 

Testing * , . , 

Each of the proposed tests was given in three classrooms: one 
classroom in a high socioeconomic school, one classroom, in a low socio- 
economic school and the third classroom in a middle income school. 

i 

2 

6 



The Stanford series was not used because ^testing materials could not 
be obtained in time ^or the June 1973 pilot test. . 

Schools were selected reuadomly from high, middle, and low-income 
ranges of tte Minneapolis Public Schools Management Index for 1972. 
.This Index is a management tool which provides a crude measure of 
socioeconcMnic status. Classrooms were selected randomly within each 
of the schools. 

Table 1 lists the, schools involved. Table 2 gives the number of 
students who^completed the tests. Two classrooms were not tested due 
to a mix-up in a mailing of test materials. These classrooms were the 
third grade, low SE school for the California Achievement Test and 
the sixth grade iow SE school for the Metropolitan-F Test. General 
test administration instructions are given in Appendix A. 

Tests were hand scored by the teachers involved in the pilot test. 
Scoring was checked in the Research and Evaluation Department and | 
discrepancies between teacher scoring and the R and E scores were 
reconciled by a third person. 

Median percentile ranks for each classroom were computed by converting 
each student's raw score to a percentile rank score and then taking a 
median for the class. A similar procedure was used to obtain an overall 
percentile rank on each ^est for the three classes combined from the 
various socioeconomic levels. Appropriate publisher's norms were used 
for each test. , - 

Teacher Reactions 

Teacher perceptions were obtained from a questionnaire which asked 

about face validity, reading difficulty, cultural bias, ease of adminis- 
tration, ease of scoring, ease of interpretation and results, and student 
reactions to the tests. This information was obtained after the teacher 
had administered the test, scored the answer sheets, and interpreted the 
test results • Appendix B provides a sanqple of the questionnaire sent to 
the teachers. 
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•Table 1 



' Test 

SRA 
CAT 
MET TI 
MET F 



SRA 
CAT 
MET-F 



Schools Involved in a Pilot Test 
of Proposed Mathematics Tests 
June 1973 



High SES 

Armatage 
Wenonah 
Azmtage 
Keewaydin 



Audubon 
Audubon 
Keewaydin 



Grade 3 



Middle SES 

Shingle Creek 
Putnam * 
Holland 
Shingle Creek 



Grade 6 



Hamilton 
Shingle Creek 
Shingle Creek 



Lov SES 

Harrison 
Harri-son 
Hay 

Harrison 



Hay 
Hay 

lyndale 



Grade 8 



SBA 
CAT 



Anthony 
Southwest 



Ramsey 
Sheridan 



Lincoln 
Bryant 
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Resvilts ♦ 

!DBLble 3 gives median percentile ranks for each test at each grade 
level for th^ coobined sauple of students from middle, low, and high 
socioeconomic schools* 

On the surface, the SBA appears to have yielded Biedian percentile 
ranks vhic)i most consistently paralleled publisher's norms. Median 
percentile ranks vere 58, 50, aUd 55 for grade? 3, 6, and 8 jrespectivel^* 

Median percentile ranks for the CAT ranged from 38 to 63* This 
larger range may have bee^ due to sampling bias, since the absence of 
Icir^incoibe pupils in the 'third grade sample resulted in a sample of 

pupils from middle and l^igh income schools* Such a biased sample would 

/ 

produce spuariously high average scores if test scores are correlated 

/ 

with sbcioeconomic leve'l* 



. ^ liespite a simila^ bias in sampling for the MET-F, the expected high 

median percentile ra^k did not occur* Sixth grade pupils who took MBT-'F 

/ 

ranked at the U6th percentile while the third grade pupil sample, which 
included low income students, ranked at the 23rd percentile* 

Third grade pupils who took the MET-2 had a median percentile rank 
of 65. 

Besu3/t6 shbwing the median percentile rank for pupils from high, 
middle and low socioeconomic schools are shown in exhibits 1, 2 and 3* 
With one exception, pupils fjrom high income schools scored highest and 
pupils^ from low income schools scored lowest-*-3regardless of the test 
used. Only with the MET-F was there a minor deviation from this general 
pattern* At the third gra^e level, pupils from the middle income schools 
scored slightly higher on MET-F than did pupils from the high income 
school* Pupils ft:om the low income a^chool scored substantially below 
the other two income groups ' however ♦ , . 
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Table 3 



Median Percentile Ranks for Minneapolis 
Papils Based on Publisher's Norms 
June 1573 



Test . 




Grade 
6 


8 




SRA 






50 


55 




CAT 






50 ' 


38 




MET II 




65 








MET P 




23 


^ 1.6^ 







Lovr SES sample of students not included* 

Third grade CAT"38 students; sixth grade MET FS"66 students. 

Other tests ranged from 63 to 82 students at each grade level* 
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Teacher Ratings 

Exhibit k summarizes the ratings of various test characteristics 
made by the teachers participating in the pilot study. A tnore complete 
picture of the exhibit entries may be obtained by referring to the 
teacher guestionaaire in Appendix B. * 

Note that only two or three teachers are involved in making these 
ratings for each test at each grade level. Exhibit h attempts to present 
a Qpmiposite picture of these ratings. ^ 

Teacher's ratings did not vary substantially for most test characteristics 
For practical purposes, similar ratings were obtained on the four tests 
for: face validity, reading difficllity, cultural bias, pupil motivation, 
administration instructions, and interpretation i|istructions. 

Ratings on a few characteristics did differ among the four tests. 

SRA ; No "unusual" ratings* 

CAT : Some question about the adequacy of practice problems was 
raised. Hand scoring procedures were rated poor by eighth grade teachers 
but this may have been due to the fact that the appropriate scoring template . 
was not available. Fewer teachers found it necessary to deviate from 
administration instructions on the CAT than on the other tests. 

Met II ; Test norm information was rated poor. Hand scoring rated 
poor to adequate. Two of the three teachers trying this test found it 
necessary to deviate from the administration instructions. 

MET F t Practice problems were adequate, at best, according to 
teacher ratings. Three of five teachers deviated from administration 
instructions. A number of low income, third grade students were reported 
discouraged by the test. In addition students had- a hard time grasping 
the directions concerning NG'(not given) and DK (don't know). 
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Exhibit k' i ^ 

Soamzy of Teacher 'Batiogs^pf 
Variotis Aspects of the Tesis 



item Grade 


SRA 




MET II 


MET Fb 


.Face 3 
Validity g 

8 


Good 
Good 
Adequat^e to good 


AdeqTiate to poor 
Adequate to good 
Adequate to* good 


Good 


Adequate 
Good 


Heading 

Difficulty 3 
6 

8 


Easy for half 
to aost 

Easy for most 

Easy for half 
to most 


Easy for half 
to most 

Easy for ikost to 
all 

Easy for half 
to most 


Easy for half 
to most 


Easy for half 
to most 

Easy for most 
to all 


Cultural ~ 
Bias ^ 

. 6 
8 


None reported 
ff ff 

ft ti 


Kone reported 
ff tt 


None reported 


None reported 
ft If . 


Problens ^ 
6 

8 


Adequate 
^ Questionable 
Adequate 


Questionable 
Questioi^able 
Adequate 


Adequate 


Ix^truction not clear 

r 

Adequate 

•i 


Ptmi 1 

Motivation 

6 
8 


Most motivated 
Half to most 
Htflf to moat 


Most to all 
motivated 

Most to all 
Half to most 


Half to no6t 
motivated 


Half motivated 
Almost all 


"'Administration 
Instruction 3 

, 6 
8 


Good to excelleni 
Good 
Good • 


* 

Good 

Excellent 
Adequate to^ good 


Good 


Good to excellent 
Good to excellent 


Deviations 3 
^fron 

Administration ^ 
Instructions 

8 


!IVo of three 
' teachers 

fjone 
One of • three 


None 
None 

One of three 


• 

Two of three 
teachers 


One Off three 
One of two 


Itest 3 
Interpretation ^ 
Instructions ' ^ 
8 


Good to excellent 
Adequate to good 
, Good . 

> 


Good 
Good 
Adequate 


Adeqmte to good 

j 


Good ^ 
Adequate to good ' 



(cont^inued) 
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Sumnary of Teacher Ratings of Various Aspects of the Tests (continued) 





SBA 


CAT 


MET II 


MET F 


*ppfit Nora ^ 

Inforaation 

6 
8 


Good 
Adequate 

Good 


Good 
Adequate to good 
Adequate 


Poor 


Good 
Good 


Scoring 

6 
8 


Good 
Adeqtiate to good 
Adequate 


Adequate* 
Poor* 




«* Vlw^^ VIA WW ^S/VM& 

Adequate to good 


Students 3 
Discouraged 

6 
8 


A Few 

A Few 
Few or none 


A Few • 
Few or none 
Few or none 


^-•A Few' 

i 


A Few 
None 



^Proper scoring template not available; substitute used. 

^Only two teachers rated the third grade CAT and the sixth Grade MET 
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Test Scoring ♦ 

Errors in test scoring were made by 411 teachers involved ia the 
pilot test (N=25) . While Q% of all scores were within five percentile 
ranks of the correct rank, some errors were substantial. About one out 
of nine pupils (11^) received scores which were more than teij percentile 
ranks above or below their correct rank. 

All ptipils in one eighth grade class received teacher scores ^ich 
were higher than their correct scores. All third grade pupils in one class, 
except one, received scores lower than their correct scores. * 

One pupil whose correct percentile rank was l8 received a teacher 

^* i 

score of 62. Another pupil with a correct rank of 91 received a rank , 
of 1^1. 
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Discussion 



It should be clear that this rather cursory analysis of the various 
tests "provides evidence >rtiich permits us to make only an educated guess 
as to which one might be most appropriate for use in the Minneapolis Schools 

The most substantial evidence we have for the various tests is the . 
median percentile ranHs for the various samples of students on each of 
the tests. However, 4e cannot conclude that the cojiibination of the high, 
middle and lew incon^ samples used in this pilot study gives us an accurate 
reflection of the results we would obtain if the tests were used citywide. 
A more accurate determination would have to be made using a sample which 
had the same proportion of high, middle and low income children as in 

the total city population. The samples used in this test consisted 

f / 
simply of one classroom in each of the schools from the three incomeX 

levels. If students from low income schools are morfeN^ighly represented 

in the general population than they were in this sample, then the overall 

median rank for any of the tests would probably be lower. 

The socioeconomic makeup of the pupil samples for each test may not 
be similar. Although the selection procedures for selecting the samples ' 
were the same for each test a determination of each child's economic 
status was not attempted. 

The fact that students from high income schools consisteritly scored 
higher than students from middle income schools and that the middle income 
students, in turn, scored consistently higher than the low income students 
gives a strong indi<:ation that whatever is measured by the tests is highly 
corffelated with socioeconomic status. The few teachers who used the tests, 
and the screening committee, reported no obvious evidence of cultural 
bias related to racial characteristics. ' 
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• Screening committee menbers indicated the desirability of using 
the same test (different forms) at the three grade levels. This would 
not be possible if either of the Metropolitan tests were selected. 

Tbe selection of the Metropolitan tests does not appear desirable - 
for other 'reasons. Median percentile ranks for the Minneapolis samples 
appear too high on MET II (65) and too low on MET F (23) at the third 
grade level. The Minneapolis percentile rank on ^ET II was extremely 
low even thorogh the sample of low income pupils was not included! 

Teachers rated several aspects of these two tests as poor (e.g. 
norm information ahd scoring for MET II; practice problems and students 
discouraged on MET F)'. 

MET II and MET F should be eliminated from further consideration 
based on the information obtained from this limited pilot study. 

> A choice between the SEA and the CAT is difficult. Based on pilot 
study information, alone, the SEA appears to be somewhat- more desirable. 
Median percentile ranks for Minneapolis pupils are closer to the 50th 
percentile on the SRA than they are on the CAT. 

Grade 
3 6-8 
■ SRA 58 50 55 
CAT 63 50 38 

These differences may be due to sajj?)ling differences. The low 

Income pupils were not included in the third grade CAT. Their inclusion 

would probably lower the median percentile rank. The low percentile 

4 

rank for the eighth grade CAT may be due to a sampling fluctuation. 
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Teacher lutings provide little credence for making a decision between 
those tiro tests. Only a few teachers were involved and no clear superiority 
for either test was ihdicated. ^ * • 

The very limited ihforraation gained from this pilot study should be 
seen in perspective. We do not know how Minneapolis teacher is will react 
to either of these tests. We have had some screening of the tests by 
a committee representing teachers and we have hc^ a v^ limited ,try-out 
in the field. On the basis of this information either of these two tests 
is worth further consideration. 
I Nevertheless, one can have no strong feeling , of confidence that the 
selection of either of these tests will dispell all the problems enccTuntered 
with previous tests* Nor can we be certain that the selection of either 
of these tests will HecessariOy be the most appropriate tests for our 
purposes. The Stanford Test which was not included in the pilot 'Study, 
because materials could not be obtained in time, could be given further 
consideration if time permits. - However, ^if the selection must be made 
now our best guess is, and its little more than a guess, that the SUA 
Test would be most suitable for Minneapolis purposes. 

The errors observed in teacher scoring my not be indicative of 
errors which would occur once a test is selected for the citywide program. 
All tests in the pilot were new to tjie teachers involved. Staff training 
for test administration was not giv^n* About 30^ of the teachers deviated 
from the manuals* administration instructions. Teachers may not have 
been as careful in test scoring for a pilot study as they would be under 
conditions where the results would directly influence children. 

Nevertheless, the errors made were so peryasive, and of such a 
magnitude, that caution and training will be required regardless of the 
test delected. Machine scoring will help eliminate some errors, but 
errors of jjdministration, interpretation, recording and transferring 
could still occur. 
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Recoranendatiozis . 

1. If a decision m\ist be made now, the SRA test should be selected, 

2. If a decision can be delayed for another year, further study of SRA, 
CAT, and the Stanford should be made. 

3. Additfonal study. If attempted, should include reactions of a larger 

. number of teachers • Each teacher should have the opportunity to rate 
all tests being considered. . - ^ 

If. The Citywide Testing Committee should direct its attention to the 

problem of test scoring. If tests are not accurately scored and recorded 
then it will matter little which test is selected. It appears desirable 
to have tests machine scored at all grades. 

5. The Committee should note three major ass\iii?)tiona underlying this 
pilot test and the resulting recommendations: 

a. Attest in the pilot study would be preferable to the te>t 
^ currently in use.* 

b. It is desirable t%use the same test (different forms) at 

gi*ades 3, 6 and S."^- . ' ^ 

c. Median scores by Minneapolis pupils shoiild approximate the 
median scores in the publisher's norms. 
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- Appendix A - 

Minneapolis Piablxo Schools. 

INSTRUCTIOJIS s " 

Enclosed are the necessary materials for administering the n»the- 
matics section of t o your class; 

copies of the test booklet, answer sheets, and the examiner* a 

maniial and scoring key. 

Please familiarize yourself^ with the directions for administering 
the test -ahead of time, and do not deviate from the directions in the 
manual during the testing session. . ^ 

The tests are to be administered to all students on whichever day 
is most convenient for you during the week of June We would prefer 
that they be given i,ri the morning. Those eighth grade teachers who have 
more than one eighth grade standard math class are requested to administer 
the test to whichever standard class meets during the earliest hour of 
the day. No, basic or enriched math classes are to be involved in this study. 

Eighth grade teachers administering the SRA should give the CONCEPTS 
section one day and the COMPUTATION section on the following day. 

Eighth grade teachers administering the California Achievement Test 

should give the COMPUTATION section on one day and the CONCEPTS and the 

-PROBLEMS sections on the following day. 

Please hand score the tests for your class with the scoring key 

provided. Ultimately, the tests used in the citywide testing program 
will be machine scored^ but we are evaluating the tests* hand scoring 
procedures becaiise the Committee on Citywide Mathembtics Testing felt 
that tefachers should, if they desire, be able to obtain immediate results 
for certain of their students by hand scoring their tests. Please score 
these tests according to the directions provided and ignore any instruc- 
tions which refer to preparation of answer sheets for nachine scoring. 

Scoring the tests for your class should take about an' hour or less of 
your tirne^ You will be reimbursed by the Research Department for this time. 
Please send a service report to the Research and Evaluation Department, 
807 N. E. Broadway, Minneapolis 55^13. Don^t forget to put your address 
on the service report ffq we can mail you a check. 

Return all these testing materials and your signed service report 
to the Research Department by Thursday, June ik. 



' Research and Evaluation. Department 
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Appendix B 
Teacher Questionnaire 
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\ ^ Minneapolis Public Schools 

TEST ' 

SCHOOL 

GRADE 



TEACHER' 



NUMBER. OP CHILDREN TESTED 



Please read the following questions and dheck the appropriate response* 
Peel free to comment in the space provided after each q\jestion. 

1* How well do the items in this t^t reflect the content of the curriculum taught? 

Excellent coverage of curriculum 

G ood coverage of curriculxim* 



JVdeqmte qo'^rage of curriculum 

JPoor covertige' oJC^ curriculum " ^ * Jff^ 
_Very poor coverage of curriculum ^ 



Comments: A * 



— : /{' 



2, How appropriate was the reading difficulty level of this test for yorir class? 
(Third grade teachers: Vere the directions read to the students easy for, them 
Easy enough for every onj^ in the class . to understand?) 



JBasy enough for most of the clas^ 

t 

JSasy enough for about half -of th^?, class 
Tod difficult for most •of the class 



^ T oo difficult for everyone in the class 
Comments 



3. Do you think this test is biased against minority or low income students? - 
Yes 

No • . i - 



Comments : 



Do you think adequate practice problems were provided when necessary? 
Yes 



No 



J/eren*t necesseury 



Comments: 
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,5. 
6. 



^ 9- 

11. 

12. 

13. 
Ik. 



About . ' 

All > Most Hfelf A Few- fene 



For how mny of yotiz*^ studen^js was the 
form or layoutf of the test 'booklet 
confufiixjg or hard to' fallow:? 

For hov many of your students was 
the form or liiyout of the answer 
sheet hard to follo^> (ieavip blank - 
if none were used) 

pow many of your students appeared 
to be motivated and*'inte)^sted in 
doing their best during the ^testing? 

How inany of your students d6 you 
think this test left feeling dis- 
couraged br defeated? 



How many of your studenta did you 
notice engaged in the fdllotdLng 
kinds of activities during the 
testing session? 



Asking questions of the teacher t 
during the timed session 

Talking with other students despite, ^ ' 

instructions to the contrary — „. «^ * 

.Being generally inattentive to * * ^ > 

the" testing--dtaring out the . . 

window, doodling, etc. \ ^ " . 

Crying , ' ^ ^ - - , 

Refusing to take test ^ ' 

Did you find it necessary to deviate at all from the, toanual's directions for test 
administration at any time? ' ^ * 

No . 



Jes If Ye^, why? 



Excellent Good Adequate Poor Very Poor 



15. How would you rate the directions 
fccr administering the test in 
the examiner's manual? 

16. 'How woul^ you rate the adequacy 
of the information presented on 
test interpretaition? 

17. How would you rate the manual's 
discussion of test norms in terms * 
of its helpfulness in interpreting 
results? 

18. How would you rate the ease of the 
hand" scoring procedures presented? 



ERLC 



May 1973 



Research and Evaluation Department 



The particular test which you tried out was one of the possible alternatives 
to the I^TBS Modem Math Supplement. It has not been predetermined that the ITBS 
should necessarily be replaced with another standa3rdized test of this form. 
The K-12 Committee on Citywide I^th Testing is considering all alternatives. 
l56 you personally have any recomnendations a's to what form of assessment would 
be most appropriate? 
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